# Low-latency speech generation
Kimi Audio 7B
MIT
Kimi-Audio is an open-source foundational audio model that excels in audio understanding, generation, and dialogue.
Speech Recognition Supports Multiple Languages
K
moonshotai
55
15
Seamless M4t V2 Large
SeamlessM4T v2 is a large-scale multilingual multimodal machine translation model released by Facebook, supporting speech and text translation for nearly 100 languages.
Text-to-Audio
Transformers Supports Multiple Languages

S
facebook
64.59k
821
Featured Recommended AI Models